Exploiting Semantic Proximity for Information Retrieval

نویسندگان

  • Sanjeet Khaitan
  • Kamaljeet Verma
  • Rajat Kumar Mohanty
  • Pushpak Bhattacharyya
چکیده

In this paper, we propose a method which exploits the semantic proximity of words in unrestricted natural language text to retrieve relevant documents. In order to facilitate this functionality, the system represents the documents and the query in the form of semantically relatable sets (SRS), which are a group of entities demanding semantic relations when the semantic representation of the sentence is ultimately produced. We also devise a method to augment the SRSs to further boost the performance. WordNet is used to deal with different forms of divergence between the query and the documents. In a series of experiments on TREC data, our semantic proximity based retrieval technique yields high precision with improved mean-average-precision in comparison to conventional retrieval techniques.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Exploiting Semantic Features for Image Retrieval at CLEF 2005

This paper presents the MIRACLE’s team approach to text-based image retrieval at ImageCLEF 2005 adhoc task. The experiments defined this year try to use semantic information sources, like semantic dictionaries or text structure. For this purpose EuroWordnet has been considered and a new algorithm to extract synonyms from the semantic database has been developed. This new algorithm implementatio...

متن کامل

Exploiting Semantic Annotations for Entity-based Information Retrieval

In this paper, we propose a new approach to entity-based information retrieval by exploiting semantic annotations of documents. With the increased availability of structured knowledge bases and semantic annotation techniques, we can capture documents and queries at their semantic level to avoid the high semantic ambiguity of terms and to bridge the language barrier between queries and documents...

متن کامل

Exploring and Exploiting Proximity Statistic for Information Retrieval Model

Proximity among query terms has been recognized to be useful for boosting retrieval performance. However, how to model proximity effectively and efficiently remains a challenging research problem. In this paper, we propose a novel proximity statistic, namely Phrase Frequency, to model term proximity systematically. Then we propose a new proximity-enhanced retrieval model named BM25PF that combi...

متن کامل

Public Transport Ontology for Passenger Information Retrieval

Passenger information aims at improving the user-friendliness of public transport systems while influencing passenger route choices to satisfy transit user’s travel requirements. The integration of transit information from multiple agencies is a major challenge in implementation of multi-modal passenger information systems. The problem of information sharing is further compounded by the multi-l...

متن کامل

Semantic Information Filtering - Beyond Collaborative Filtering

In this paper we introduce our idea of a semantic information filtering system. Contrary to traditional information filtering systems exploiting information retrieval techniques to select relevant data, we propose a workflow exploiting semantic information obtained from the web. Our system utilises the structured information crawled from the semantic web to improve the process of extracting the...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006